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Method of colo r image pro cessing to eliminate shadows and reflections 

' BACKGROUND OF THeInVENTION 

The present invention relates to a method for the processing of 
color images to eliminate shadows. It can be applied especially to the 
heightening of contrast in remote surveillance, and to image processing 
5 before segmentation in mobile robotics. 

In applications of mobile robotics, operations of image 
segmentation are carried out. Segmentation makes it possible to detect and 
localize the edges of roads for example. The shadows of trees on these 
roads or reflections from puddles generate parasitic segments. These 
10 shadows and reflections constitute parasitic information. The invention 
makes it possible to eliminate this parasitic information by carrying out a 
processing of color images that are to be segmented. 

SUMMARY OF THE INVENTION 

An object of the invention is a method of color image processing, 
15 comprising the following steps: 

- the color image is converted into an intermediate image comprising 
components that depend solely on the H and S components of the 
original image in an HSV or HLS representation; 

- the intermediate image comprising at least two components X and Y, 
20 determined by functions especially of the H component, these 

functions taking the same value when the H component is zero or 
equal to one; 

- a new image comprising only one component is generated from the 
intermediate image, the component of this new image being a function 

25 of the components of the intermediate image. 

The method according to the invention has many advantages: 

- it can be used to extract the useful information from a color image with 
a minimum loss of information; 

- it can be used to compress the information in a monochromatic image; 
30 - it is totally non-parametrical and can therefore be applied to all images 

with the same efficiency, without any particular setting, whatever the 
scene. 

After a processing operation of this kind, it is possible to use 
monochromatic image segmentation algorithms while at the same time 
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exploiting the wealth of information of a color image in getting rid of errors 
due to shadows or reflections of water films. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The Invention shall now be described in great deal in the context 
5 of a particular example of a practical implementation. In this description, 
reference shall be made to the figures of the appended drawings, of which: 

- Figure 1 represents the RGB color space; 

- Figure 2 represents the HSV color space; 

- Figure 3 is a graph showing the steps of implementation of the 
10 invention according to an advantageous mode using texture attributes; 

- Figure 4 exemplifies a result of a Karhunen-Lo6ve transformation 
operation; 

- Figure 5 shows an altemative implementation of the example shown in 
figure 3; 

15 - Figure 6 is an exemplary histogram of a new image obtained by the 
implementation of the invention to which a filtering of pixels and an 
adjustment of the dynamic range is applied; 

- Figure 7 shows another exemplary adjustment of the dynamic range. 

MORE DETAILED DESCRIPTION 

20 Referring to figure 1 , a color image is generally described in the 

RGB (red, green, blue) space. The color of each pixel is represented therein 
by three components R (red), G (green), B (blue). These components may 
be represented on three axes 11, 12, 13 in Cartesian coordinates. The 
primary colors, cyan, blue, magenta, red, yellow and green, form the comers 

25 of a cube 10 in this space. Black and white form opposite corners of this 
cube. The segment connecting these points is a main diagonal of this cube 
on which only the luminance L varies. 

Referring now to figure 2, other color spaces exist. In the HSV 
(hue, saturation, value) space, color is represented by three components H, 

30 S, and V. These components may be represented in a referential system of 
cylindrical coordinates. The H component is represented by an angle 21. 
The S and V components are represented on two axes 22 and 23. The H 
component is the hue. This component makes it possible for example to 
distinguish red and yellow. The S component is saturation, namely the purity 

35 of the color. This component makes it possible for example to distinguish red 
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and pink. The V component is the value or intensity, namely the quantity of 
light. This component makes it possible, for example, to distinguish between 
dark gray and light gray, or again between dark red and light red. It is 
assumed hereinafter in the description that the components H, S, V are 
5 normalized between 0 and 1. The following explanations may be transposed 
in an obvious way if these components are not normalized between 0 and 1 . 

The passage from the RGB representation to the HSV 
representation is done according to known analytical formulae. The primary 
colors and black form a six-faced pseudo-cone (hexcone) 20. The plane with 

10 a constant value of V corresponds to the projection of the RGB cube parallel 
to its main diagonal. 

The Applicant has noted that the value or intensity V does not 
carry a great deal of useful information because the elements of the image to 
be segmented are distinguished far more by their spectral properties than by 

15 their reflection or their total absorption. Furthermore, the value or intensity V 
carries information that leads the segmentation algorithms into error, since 
shadowed and non-shadowed zones of a same object (a road for example) 
have very different intensities (with variation of intensity at the source). 
Similarly, a water film on the road gives rise to a strong reflection and, 

20 therefore, a very different intensity from that of a dry or simply wet portion on 
the same road (with variation of intensity by modification of the properties of 
reflection). 

In order to eliminate the infomnation on value or intensity, the color 
image is converted into an intermediate image in which the V component is 
25 absent. This intermediate image includes components that depend solely on 
the H and S components of the original image in an HSV representation. 

This makes it possible to eliminate cast shadows and reflections 
due to water films. This furthermore has the advantage of reducing the 
quantity of information present in the image (by the elimination of a 
30 component). 

The conversion from the RGB space to the HSV space gives rise 
to singularities of hue for certain points. Thus, for the achromatic points 
(R=G=B), a small variation in RGB may lead to a big variation in H. 
Furthermore, for the points in the vicinity of the axis of origin of H (G=B, 
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R>(G+B)/2). a small disturbance In the components R, G, B may lead to 
major variations In H (a change in the sign of H). 

Furthermore, the distribution of the converted values is not 
uniform. Digitization may lead to peaks and holes in the distribution of the 
5 values. In other words, the information is noise-ridden around V=0. 

Advantageously, components X and Y are used in the 
intermediate image. These components are determined by functions 
especially of the H component, and take the same value when the H 
component is zero or equal to one. Thus, the similarities Induced by the H 
10 component, proper to the system of cylindrical coordinates, are eliminated. 

These functions are written as Gj^{h) and Gy{H). They therefore 
verify the following relationships: Gj^{o)=G^{l) and Gy(o) = Gy(l). The 
components X and Y are then determined by: 

x = gAh) 

15 Y = Gy{H) 

For example, the following functions may be used: 

G;^{H) = cos{27tH-(p) 
Gy{H)^sm{27cH-(p) 

where is a constant. 
20 A third component can be added to this intermediate image: 

saturation S. Thus, in the intennediate image, there is hue information H and 
saturation information S, while at the same time the singularities have been 
eliminated. 

Advantageously, it is possible to combine hue information H and 
25 saturation information S in only two components of the intermediate image. 
To this end, the components X and Y are determined by functions not only of 
the H component but also of the S component. In the same way as above, 
these functions take the same value when the H component is zero or equal 
to one. Furthermore, these functions tend toward zero when the S 
30 component tends toward zero. 

These functions are written as G]^{h,S) and G'y{H,S). They 

therefore verify the following relationships: 

G'A0,S)=G'AhS) 

G'y{0,S)=Gy{l,S) 
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G'Afi.S) ^ 0 

S ^0 
Gy{H,S) 0 

S ^0 

For example, it is possible to use a product of two independent 
functions respectively of the S component and of the H component: 

Gy{H,S) = Fy{s)xGy{H) 
The functions Fj^{s), Fy{s) of the S component are functions 

tending toward zero when the S component tends toward zero. These 
functions may be, for example, the identity function (for example F;^{s)=S) 

10 or the square function (for example Fj^{s) = S^). Preferably, monotonic and 
continuous functions will be used. 

Referring now to figure 3, in an advantageous variant, an 
intermediate image 34 is generated. This intermediate image furthermore 
comprises texture attributes. The texture attributes are determined from an 

15 image 32 comprising the components X and Y for example, and not from the 
initial color image 30 encoded in the RGB space. Indeed, it is sought first of 
all to eliminate the shadows and the reflections. 

For example the initial color image 30 is converted into an image 
31 with two components, the H and S components. Then, the image 31 is 

20 converted into an image 32 that comprises the two components X and Y 
defined here above. For each pixel of the image 32, texture attributes are 
determined. These attributes may be, for example, the attributes defined by 
Haralick. Of course, other attributes may be used, through a fractal approach 
for example. An image 33 is obtained with texture attributes. The 

25 components X and Y and the texture attributes are combined to form the 
intermediate image 34, to which the remaining processing operations are 
applied. It is thus possible to apply a multitude of transformations to the 
intermediate image without being preoccupied by shadows and reflections. 

In order to obtain a monochromatic image, a new image is 

30 generated from the intermediate image obtained. In other words, a 
transfomnation is applied to the intermediate image so as to keep only one 
component. The single component of the new image is a function of the 
components of the intermediate image. 
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Referring now to figure 4, in an advantageous embodiment of the 
invention, the new image is generated by keeping only the first component of 
the Karhunen-Loeve transformation. 

This transformation is applied in the referential system D1, D2 of 
5 the eigen vectors are associated with the covariance matrix of the pixels of 
the intermediate image. There is thus a passage from the X, Y referential 
system to the D1, D2 referential system. As can be seen in the exemplary 
distribution of the pixels 40 of the image in these referential systems, this 
transformation enables the decorrelation of the information present in the 
10 image. The first component D1 is the most discriminatory component, i.e. it 
is this component that maximizes the information. This facilitates 
discrimination in the segmentation processing operations applied to the new 
image. 

According to an alternative mode of implementation, a linear 
15 approximation of the Karhunen-Loeve transformation is applied, not the 
Karhunen-Loeve transformation itself. To this end, the Karhunen-Loeve 
transformation is applied to a representative set of images (for example eight 
images). The images of this set have characteristics similar to the images to 
which it is sought to apply the linear approximation. An average is then taken 
20 of the changes in referential system obtained by the exact Karhunen-Loeve 
transformation. A average change of referential system is obtained, and this 
referential will be used as an approximation of the Karhunen-Lo^ve 
transformation. 

According to another mode of implementation, the new image is 
25 generated by projecting the components of the intermediate image in the 
plane in which the dynamic range or the mean standard deviation is the 
greatest. 

Reference is now made to figure 5 which represents an alternative 
mode of implementation with respect to the example shown in figure 3. In 

30 the same way, from the initial color image 50, an image with two 
components, the H and S components, is generated. Then an image 52 
comprising the two components X and Y is generated. Then the Karhunen- 
Lofeve transformation (or a linear approximation of this transformation) is 
applied to the components X and Y. The first component of the result of this 

35 transformation is kept to form an image 53 comprising a single component. 
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Texture attributes of this image 53 are determined. These texture attributes 
form an image 54. The single component of the image 53 and the texture 
attributes (image 54) are combined to form the intermediate image 55 
comprising color information and texture information. The remaining 
5 processing operations are applied to this intermediate image 55. 

The Karhunen-Loeve transformation can thus be applied a second 
time, for example to obtain a monochromatic image. 

Reference is now made to figure 6 which represents an exemplary 
histogram 60 of the component of the monochromatic image. The x-axis 61 

10 represents the values taken by the pixels. The y-axis 62 represents the 
number of pixels in the image having a determined value. A non-negligible 
range of the values is taken up by pixels that are very small in number at the 
two extreme ends 63a, 63b of the histogram. 

Advantageously the lightest and the darkest pixels, which are 

15 present in very small numbers, are filtered. This makes it possible to avoid 
any loss of dynamic range through the occupying of a major part of the range 
of values by pixels that carry no additional information. In other words, the 
darkest and the lightest pixels, which represent a determined fraction of the 
total number of pixels of the image, are filtered. To this end, a number of 

20 pixels to be eliminated from the image is determined. This number of pixels 
may be expressed as a percentage of the total number of pixels of the image. 
A minimum value VMIN and a maximum value VMAX of the pixels are 
determined, so that the number of pixels of the image whose value is below 
VMIN or above VMAX is as close as possible to the number of pixels to be 

25 eliminated. For all the pixels whose value is beyond the values VMIN and 
VMAX, the value of the corresponding limit is assigned. 

The values VMIN and VMAX may be determined iteratively. 
During an iteration, a value 64 of the y-axis 62 is selected in going through 
the axis from the bottom to the top. The smallest value VMIN and the 

30 greatest value VMAX of the y-axis 61 , corresponding to the intersections 64a, 
64b of the histogram 60 with the selected value 64, are determined. The 
operation is stopped when the number of pixels to be eliminated for the 
selected value is greater than the determined fraction of the total number of 
pixels of the image. 
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The values taken by the component of the monochromatic image 
(the most discriminatory component) are distributed in a non-normalized 
range. In preparation for subsequent processing operations (such as 
segmentation, viewing, etc.), these values may be brought by linear 
5 transformation into a normalized interval, for example [0.1], or [0.255 ]. This 
linear transformation 65 may be applied, as illustrated in figure 6, after the 
pixels have been filtered in small numbers at the two limits of the histogram. 
Thus, for an image encoded on 8 bits, the value VMIN is brought to a level 
zero, and the value VMAX is brought to a level 255. 

10 Referring now to figure 7, an advantageous variant is described of 

the adjustment of the values used to process the image by giving greater or 
lesser importance to the contrast in a selected zone. 

In this advantageous variant, the beginning of the processing is 
carried out in a similar way. The parameters of the Karhunen-Loeve 

15 transformation (or a linear approximation of this transformation) are 
determined according to the selected zone of the image only. This 
transformation is then applied to the entire image. The pixels of the selected 
zone occupy only a part 70 of the histogram 60, 

Similarly, a linear transformation can be applied to bring the values 

20 of the pixels into a normalized interval. Advantageously, the parameters of 
the linear transformation are determined so as to bring the minimum value 
VMIN and the maximum value VMAX of the selected zone between two 
predetemnined levels NMIN and NMAX. These levels NMIN and NMAX are 
strictly contained in the normalized interval. For example, for an image 

25 encoded on 8 bits, the normalized interval being [0.255 ], a value of 32 can 
be taken for NMIN and 224 for NMAX. 

These parameters are used to apply a linear transformation to the 
entire image, the levels of the pixels being limited (for pixels outside the 
selected zone) in order to be included in the normalized interval. 

30 This advantageous variant is used to obtain efficient discrimination 

in the regions similar to the selected zone, while at the same time keeping 
other information out of this selected zone. 



